Maximization of AUC and Buffered AUC in Classification
نویسندگان
چکیده
This paper utilizes a new concept, called Buffered Probability of Exceedance (bPOE), to introduce an alternative to the Area Under the Receiver Operating Characteristic Curve (AUC) performance metric called Buffered AUC (bAUC). Central to the creation of bAUC is a new technique for calculation and optimization of bPOE. We show this formula to be easily integrable into optimization frameworks, often reducing bPOE minimization to convex, sometimes even linear, programming. Then, we utilize bPOE to create the bAUC performance metric, showing it to be an intuitive counterpart to AUC. In addition, we show that bAUC is much easier to handle in optimization frameworks than AUC, specifically reducing to convex and linear programming. We use these friendly optimization properties to introduce the bAUC Efficiency Frontier, a concept that serves to partially resolve the “incoherency” that arises when misclassification costs need be considered. We conclude that bAUC avoids many of the numerically troublesome issues encountered by AUC and integrates much more smoothly into the general framework of model selection and evaluation.
منابع مشابه
Increasing the accuracy of the classification of diabetic patients in terms of functional limitation using linear and nonlinear combinations of biomarkers: Ramp AUC method
The Area under the ROC Curve (AUC) is a common index for evaluating the ability of the biomarkers for classification. In practice, a single biomarker has limited classification ability, so to improve the classification performance, we are interested in combining biomarkers linearly and nonlinearly. In this study, while introducing various types of loss functions, the Ramp AUC method and some of...
متن کاملAUC Maximization with K-hyperplane
The area under the ROC curve (AUC) is a measure of interest in various machine learning and data mining applications. It has been widely used to evaluate classification performance on heavily imbalanced data. The kernelized AUC maximization machines have established a superior generalization ability compared to linear AUC machines because of their capability in modeling the complex nonlinear st...
متن کاملEfficient AUC Maximization with Regularized Least-Squares
Area under the receiver operating characteristics curve (AUC) is a popular measure for evaluating the quality of binary classifiers, and intuitively, machine learning algorithms that maximize an approximation of AUC should have a good AUC performance when classifying new examples. However, designing such algorithms in the framework of kernel methods has proven to be challenging. In this paper, ...
متن کاملOnline AUC Maximization
Most studies of online learning measure the performance of a learner by classification accuracy, which is inappropriate for applications where the data are unevenly distributed among different classes. We address this limitation by developing online learning algorithm for maximizing Area Under the ROC curve (AUC), a metric that is widely used for measuring the classification performance for imb...
متن کاملStochastic Online AUC Maximization
Area under ROC (AUC) is a metric which is widely used for measuring the classification performance for imbalanced data. It is of theoretical and practical interest to develop online learning algorithms that maximizes AUC for large-scale data. A specific challenge in developing online AUC maximization algorithm is that the learning objective function is usually defined over a pair of training ex...
متن کامل